Basic Statistics

Raw Counts

Name Value
Rows 336,776
Columns 42
Discrete columns 17
Continuous columns 25
All missing columns 0
Missing observations 809,170
Complete Rows 906
Total observations 14,144,592
Memory allocation 93 Mb

Percentages

Data Structure

Missing Data Profile

Univariate Distribution

Histogram

Bar Chart (with frequency)

## 6 columns ignored with more than 50 categories.
## dest: 105 categories
## tailnum: 4044 categories
## flight: 3844 categories
## time_hour: 6936 categories
## model: 128 categories
## name: 102 categories

QQ Plot

## Warning: Removed 107 rows containing non-finite values (stat_qq).
## Warning: Removed 107 rows containing non-finite values (stat_qq_line).

## Warning: Removed 1498 rows containing non-finite values (stat_qq).
## Warning: Removed 1498 rows containing non-finite values (stat_qq_line).

## Warning: Removed 132 rows containing non-finite values (stat_qq).
## Warning: Removed 132 rows containing non-finite values (stat_qq_line).

Correlation Analysis

## 3 features with more than 20 categories ignored!
## tailnum: 23 categories
## flight: 211 categories
## time_hour: 848 categories
## Warning in cor(x = structure(list(year.x = c(2013L, 2013L, 2013L, 2013L, : the standard deviation is zero

Principal Component Analysis

## 2 features with more than 50 categories ignored!
## flight: 211 categories
## time_hour: 848 categories
## Warning in plot_prcomp(data = structure(list(dest = c("ATL", "ATL", "ATL", : The following features are dropped due to zero variance:
##  * year.x
##  * tz_origin
##  * dst_origin_A
##  * tzone_origin_America.New_York
##  * dst_dest_A